Generating a Word-Emotion Lexicon from #Emotional Tweets
نویسندگان
چکیده
Research in emotion analysis of text suggest that emotion lexicon based features are superior to corpus based n-gram features. However the static nature of the general purpose emotion lexicons make them less suited to social media analysis, where the need to adopt to changes in vocabulary usage and context is crucial. In this paper we propose a set of methods to extract a word-emotion lexicon automatically from an emotion labelled corpus of tweets. Our results confirm that the features derived from these lexicons outperform the standard Bag-of-words features when applied to an emotion classification task. Furthermore, a comparative analysis with both manually crafted lexicons and a state-of-the-art lexicon generated using Point-Wise Mutual Information, show that the lexicons generated from the proposed methods lead to significantly better classification performance.
منابع مشابه
#Emotional Tweets
Detecting emotions in microblogs and social media posts has applications for industry, health, and security. However, there exists no microblog corpus with instances labeled for emotions for developing supervised systems. In this paper, we describe how we created such a corpus from Twitter posts using emotionword hashtags. We conduct experiments to show that the self-labeled hashtag annotations...
متن کاملUsing Hashtags to Capture Fine Emotion Categories from Tweets
Detecting emotions in microblogs and social media posts has applications for industry, health, and security. Statistical, supervised automatic methods for emotion detection rely on text that is labeled for emotions, but such data is rare and available for only a handful of basic emotions. In this paper, we show that emotion-word hashtags are good manual labels of emotions in tweets. We also pro...
متن کاملCrowdsourcing-based Annotation of Emotions in Filipino and English Tweets
The automatic analysis of emotions conveyed in social media content, e.g., tweets, has many beneficial applications. In the Philippines, one of the most disaster-prone countries in the world, such methods could potentially enable first responders to make timely decisions despite the risk of data deluge. However, recognising emotions expressed in Philippine-generated tweets, which are mostly wri...
متن کاملNSEmo at EmoInt-2017: An Ensemble to Predict Emotion Intensity in Tweets
In this paper, we describe a method to predict emotion intensity in tweets. Our approach is an ensemble of three regression methods. The first method uses contentbased features (hashtags, emoticons, elongated words, etc.). The second method considers word n-grams and character ngrams for training. The final method uses lexicons, word embeddings, word ngrams, character n-grams for training the m...
متن کاملRoot-Word Analysis of Turkish Emotional Language
This paper describes a model for the perceived emotion of Turkish sentences based on the emotions associated with the constituent words. In our model, each emotion is mapped to a point in the continuous space defined by three emotional attributes: valence, activation, and dominance. We collected a large data set through two independent surveys: a word-level survey that prompted users with emoti...
متن کامل